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Abstract. Let M n be a n X n Wigner or sample covariance random matrix, 
and let u\ (M„), ^2 (,M n ), ■ ■ ■ , Un(M n ) denote the unordered eigenvalues of M n . 
We study the fluctuations of the partial linear eigenvalue statistics 

n—k 

1=1 

as n — > 00 for sufficiently nice test functions /. We consider both the case 
when k is fixed and when min{fc, n — k} tends to infinity with n. 



1. Introduction 

We consider two classic random matrix ensembles with independent entries. 
1.1. Wigner Random Matrices. 

Definition 1 (Wigner random matrix). We say M n = ^=W„ = ■j%(wnij)i<i,j<n 
is a real symmetric (Hermitian) Wigner matrix of size n if M n is a n x n real 
symmetric (Hermitian) matrix that satisfies the following. 

(i) {w n ij : 1 < i < j < 11} is a collection of independent random variables, 

(ii) for 1 < i < j < n, w n ij has zero mean and unit variance, 

(iii) for 1 < i < n, w n u has zero mean and variance a 2 . 

For a Wigner matrix M n of size n, we let Ai(M„) < \2{M n ) < ■■■ < \ n (M n ) 
denote the ordered eigenvalues of M n and let Hi(M n ), fji2(M n ), . . . , /j, n (M n ) denote 
the unordered eigenvalues of M n . That is, fii(M n ) = X n a\(M n ) for 1 < i < n, 
where 7r is a random permutation on {1,2, ... ,n}, chosen uniformly, independent 
ofM„. 

We will be interested in sequences of Wigner random matrices {Af n }„>i that 
satisfy the following condition. 

Definition 2 (Condition CO). For each n > 1, let M n be a real symmetric (Her- 
mitian) Wigner matrix of size n. We say the sequence {M„}„>! satisfies condition 
CO with exponent p > if there exists e > such that 



lim n p/2 1 ^ y ... 

l<i<j<n 



nij L {\w ni j\>n 1 / 2 ~ E } 



where 1e denotes the indicator function of the event E. 

Remark 3. For each n > 1, let M n = -^=W n = -^{w n ij)i<i,j< n be a real sym- 
metric (Hermitian) Wigner matrix of size n. We note that if there exists 5 > such 
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that 



sup 

n>l,l<i<j<ro 



E\w nij \ 



4+p+S 



< oo and 



sup E|tu„j,-| 



2+p+S 



< 00 



n>l,l<i<n 



then {M n }„>i satisfies condition CO with exponent p. We will mostly be interested 
in the cases when p = 0, 1. 

1.2. Sample Covariance Random Matrices. 

Definition 4 (Sample covariance matrix). Let A n = ^X^X n be an n x n matrix 
where X n — (a;nij)i<i.j<n- We say that A n is a real (complex) sample covariance 
matrix of size n if {x n ij : 1 < i,j < n} is a collection of real (complex) independent 
random variables each with zero mean and unit variance. In the complex case, we 
also require £(2^-) = for all 1 < i,j < n. 

For a sample covariance matrix A n of size n, we let Xi(A n ) < \2{A n ) < • • • < 
A n (j4n) denote the ordered eigenvalues of A n and let /Zi(^4„),/i2(j4„), . . . , [i n (A n ) 
denote the unordered eigenvalues of A n . 

We will be interested in sequences of sample covariance matrices {A n } n >i that 
satisfy the following condition. 

Definition 5 (Condition CI). For each n > 1, let A n = -^X*X n be a real (com- 
plex) sample covariance matrix of size n where X n — (x n ij)i<ij< n - We say the 
sequence {A n } n >i satisfies condition CI if the random variables x n ij, 1 < i,j < n 
have symmetric distribution for all n > 1 and there exists a constant C\ such that 



1.3. Known Results. For a Hermitian n x n matrix B, the empirical spectral 
distribution (ESD) F B (x) of B is given by 



where \\(B), \2(B), . . . , X n (B) denote the eigenvalues of B. Here #5* denotes the 
cardinality of the set S. 

A fundamental problem in random matrix theory is to determine the limiting 
distribution of the ESD as the size of the matrix tends to infinity. In the 1950s, 
Wigner studied the limiting ESD for a large class of random Hermitian matrices 
whose entries on or above the diagonal are independent [30]. Under certain con- 
ditions, Wigner showed that the ESD of such a matrix converges to the semicircle 
law F with density given by 



The most general form of Wigner 's semicircle law assumes only the first two 
moments of the entries [3l Theorem 2.9]. 

Theorem 6 (Wigner's semicircle law). For each n > 1, let M n — -^=W n = 

-^(w n ij)i<ij< n be a real symmetric (Hermitian) Wigner matrix of size n. As- 
sume that for any e > 0, 



sup E\x nij \ p < (CiVp) p for all p > 1. 



F B (x) := -# {1 < i < n : \,{B) < x} , 



(1) 





i.j=l 
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Then the ESD of M n converges to the semicircle law F with density p defined in 
([T]), almost surely as n — > oo. Equivalently, for any continuous, bounded function 
f, 



1 _ n _ r°° 

-J2f(HM n )) —> / f(x) P (x)dx 



1=1 

almost surely as n — > oo. 

The sample covariance case was studied by Marchenko and Pastur [53]. In par- 
ticular, they showed that, under certain conditions, the ESD of a sample covariance 
random matrix converges to ^mp with density given by 



(2) Pmp(x) := 



0, otherwise. 



The Marchenko-Pastur law is the limiting ESD for a large class of sample co- 
variance random matrices [3] Theorem 3.10]. 

Theorem 7 (Marchenko-Pastur law). For each n > 1, let A n — ^X^X n be a real 
(complex) sample covariance of size n, where X n — (a; n y)i<jj< n . Assume that for 
any e > , 



i J = l 



Then the ESD of A n converges to the Marchenko-Pastur law Fmp almost surely as 
n — > oo. Equivalently, for any continuous, bounded function f, 



1 



YjiX^An)) — ^ / f(x)p M p(x)dx 



i=l 

almost surely as n — > oo . 

Theorems [5] and [7] can be viewed as random matrix theory analogues of the Law 
of Large Numbers from classical probability theory. Thus a Central Limit Theorem 
for fluctuations of linear eigenvalue statistics is a natural next step. 

In [21] , Shcherbina studies the fluctuations of linear eigenvalue statistics for both 
Wigner and sample covariance random matrices. In particular, she considers test 
functions / from the space H s with the norm 

2s I 



||/|tf:= J(l + 2\l\r\f(l)\ A dl 
for s > 3/2, where / is the Fourier transform of / defined by 

f(l) := -±= J e ax f{x)dx. 



We note that if / is a real-valued function with / € T~L S for s > 3/2, then both 
/ and /' are continuous and bounded almost everywhere [17j . In particular, this 
implies that / is Lipschitz. 

Shcherbina obtains the following two results [24] Theorems 1 and 2] . 

Theorem 8 (Linear eigenvalue statistics for Wigner matrices; 24 ). For each n > 
1, let M n — -j^Wn — -^(w n ij)i<i.j<n be a real symmetric Wigner matrix of size 



4 



S. O'ROURKE AND A. SOSHNIKOV 



n. Suppose E[iiQ.] = 7714 for all 1 < i < j < n and all n > 1. Assume for any 
e>0, 



( I - 1 

lim -VE|ra nii | 2 l { | | >e ^+- V E| 

\ i=l l<i<j<n 



UJ-n.ii I 1/1 „> . .l~>=-,/Sri I — 0. 



u nij\ A {l™^jl>£\M 

Lei f be a real-valued function with \\f\\ s < 00 for some s > 3/2. Then 

n n 

£/(MM„)) -E53/(Ai(M n )) — ► tf(Q,v 2 [/]) 

i=l i=l 

m distribution as n —¥ 00, where 

(3) := ± f f (M-fMY ±-*y 

yj Ui 2^J_J_ 2 \ x-y J % /4^V4 3 F 



2tt 2 V-/-2 / 4tt 2 \J-2 



a; 



Theorem 9 (Linear eigenvalue statistics for sample covariance matrices; 24 ). For 
each n > 1, Ze£ A n = — X*X n be a real sample covariance matrix of size n, where 
X n = {x n ij)i<i,j<n- Suppose E^y] = m4 /or aZi 1 < i,j < n and all n > 1. 
^.sswme i/iere exists e > swc/i £/ia£ 

sup sup E|a; ra jj | 4+£ < 00. 

n>l l<ij<n 

Lei f be a real-valued function with ||/|| s < 00 /or some s > 3/2. Then 

n n 

£ /(Ai(A n )) - E^ /(A 2 (A„)) — > iV(0, »§ c [/]) 

i=l i=l 

m distribution as n 00, where 

2 r « 1 ff(x)-f(y)Y (4-(i-2)(y-2)) 



(4) Wsc[/]:= 2^y X l^^J vTFWRFf ^ 



+ ^<Y , *- 2 dxV. 



4tt 2 



v/4 - (z - 2)2 



Analogous results for other random matrix ensembles (and other classes of test 
functions /) have also been obtained; see for example p7l [4l [9l ITOl IT9l [22l [24l [25l 
l26l l28l and references therein. 



1.4. Main Results. One can observe from Theorems [3] and |H] that the variance 
of the linear eigenvalue statistics does not grow to infinity in the limit n — > 00 
for sufficiently smooth test functions. This points to very effective cancellations 
between different terms of the sum and a rigidity property for the distribution of 
the eigenvalues. 

In [18., K. Johansson considered, among other things, linear statistics 

n 

(5) = 

3=1 
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of the eigenvalues e 1 , . . . , e n of a random n x n unitary matrix distributed ac- 
cording to Haar measure on U(n). He proved the CLT for such linear statistics 
under the optimal condition 

oo 

^\k\\g k \ 2 < oo, 

— oo 

where gu are the Fourier coefficients of g, and also connected the CLT result to 
the Szego asymptotic formula for Toeplitz determinants (see e.g. [TH [SUl 
Johansson's proof relies on elaborate cancellations. In particular, it works with 
minor modifications for more general /3 ensembles, /3 > (the Haar measure case 
corresponding to f3 = 2). 

In remark 2.1 of [18, Johansson noted that for S n i = sin#i + ...sin0 n _i, 
the sum of the first n — 1 terms in ([5]) with g(6) = sa\9 1 the distribution of the 
normalized statistic 

S n ,l — lES'nj 

^Var S n ,i 

does not converge to the standard normal distribution as n — > oo. The argument 
relies on the CLT for S n and the following facts: VarS^ = 1/2, Var5„,i — > 1, as 
n — > oo, and \S n — S n ,i\ < 1- 

In this paper, we study the fluctuations of the partial linear eigenvalue statistics 

n—k 

(6) S ntk [f}:=Y,f(»i(M n )) 

where k = k{n) is a positive integer sequence, / is a sufficiently nice test function 
from T-L S7 s > 3/2, and {M„}„>i is a sequence of Wigner matrices that satisfy 
condition CO. 

Theorem 10. For each n > 1, let M n = -^=W n = -^(uViij)i<ij<n be a real sym- 
metric Wigner matrix of size n. Assume the sequence {M n } n >i satisfies condition 
CO with exponent and suppose E[w^ i:) ] = ttia for all 1 < i < j < n and all n > 1. 
Let f be a real-valued, bounded Lip schitz function with \\f\\ s < oo for some s > 3/2. 
Let k be a fixed positive integer and let S n _k[f] be defined by ([6]). Then 

fc 

. i=l 

in distribution as n — » oo ; where ipi , . . . , ipk are i.i.d. semicircle- distributed random 
variables and v 2 [f] is given in §3§ . 

Theorem 11. For each n > 1, let M n — -^=W n — ^(w m j)i<ij< n be a real sym- 
metric Wigner matrix of size n. Assume the sequence {M„}„>i satisfies condition 
CO with exponent 1 and suppose EJu^-] = run for all 1 < i < j < n and all n > 1. 
Let f be a real-valued, bounded Lip schitz function with ||/|| s < oo for some s > 3/2. 
Let k = k(n) be a positive integer sequence such that min{/c, n — k} — > oo as n — > oo. 
Let S ny k[f] be defined by Then 

a n .k (S n , k [f] - ES n . k [f}) — > N(0,d 2 [f}) 
in distribution as n — > oo, where 



S n , k [f]-ES n , k [f]^N(0,v 2 [f])* 
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and d 2 [f] :— Var[/(^)] for a semicircle-distributed random variable ip- 

Remark 12. One can also study the case when n — k is a fixed positive integer. 
In the proof of Theorem [10] below, we show that if i is a fixed positive integer 

i i 

^/( Mi (M„))^^/(i) 

»=i »=i 
in distribution as n — ¥ oo, where ipi,...,ipi are i.i.d. semicircle-distributed random 
variables. 

It should be mentioned that a different type of partial linear eigenvalue statistic 
for Wigner matrices has been recently studied by Bao, Pan, and Zhou in [5]. In 
particular, they consider Yli=i f(Xi{M n )), where Xi(M n ) < \2(M n ) ... < \ n (M n ) 
are the ordered eigenvalues of M n and k is proportional to n. 

Now we turn our attention to sample covariance random matrices. Let {A n } n >i 
be a sequence of sample covariance matrices that satisfies condition CI. In this 
case, we consider the partial linear eigenvalue statistics 

n—k 

(8) r n , fc [/] := £ /(A*i(A0). 

i=l 

Theorem 13. For each n > 1, let A n — —X^X n be a real sample covariance 
matrix of size n where X n = [x n ij)\<i t j< n . Assume the sequence {A n } n >i satisfies 
condition CI and suppose Efjc^J = (114 for all 1 < i, j < n and all n > 1. Let f be 
a real-valued, bounded Lipschitz function with \\f\\ s < 00 for some s > 3/2. Let k 
be a fixed positive integer and let T n ^[f] be given by (|5|). Then 

k 

-E[/w- E /w] » 

<=1 

m distribution as n — > 00, where ip\, . . . ,tpk are hi-d. Marchenko-Pastur distributed 
random variables and u| c [/] is given by Q • 

Theorem 14. For eac/i n > 1 7 Zet A„ = -^X*X n be a real sample covariance 
matrix of size n where X n = (x n ij)i<i,j<n- Assume the sequence {A n } n >i satisfies 
condition CI and suppose E[x^,-] = (114 for all 1 < i,j < n and all n > 1. Lei / &e 
a real-valued, Lipschitz function with \\f\\ s < 00 for some s > 3/2. Let k — k(n) be 
a positive integer sequence such that min{fc, n — k} — > 00 as n — > 00. Let T n ^[f\ be 
given by (j8J . T/ien 

an.fc (T„, fc [/] - ET„, fc [/]) — > iV(0,4 c [/]) 

m distribution as n — > 00, where a n< k is defined in onrf := Var[/( , (/')] /or 

a Marchenko-Pastur distributed random variable tp. 

The last two theorems are valid under assumptions weaker than condition CI, 
since one can derive the local Marchenko-Pastur law3 at the optimal scale under 
assumptions analogous to those in condition CO 

lr The conclusion of the Marchenko-Pastur law (Theorem can be equivalently stated as 

> / PMp(x)dx 

n J 1 

almost surely as n — > 00, for any fixed interval I. The local Marchenko-Pastur law refers to a 
similar conclusion holding when the interval / is allowed to change with n. Of particular interest 



T n , k [f] - ET n , fc [/] — > N(0,v 2 sc [f\) * 
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1.5. Notation and Overview. Asymptotic notations such as O, o, f2, and so 

forth, are used under the assumption that n — > oo. The notation Oc(0 emphasizes 
that the hidden constant depends on C. 

An event E, which depends on n, is said to hold with overwhelming probability if 
W(E) > 1 — Oc(n ) for every constant C > 0. We let E c denote the complement 
of the event E. 

The paper is organized as follows. In Section [21 we prove Theorems [TOl and ITTI 
Section [3] is devoted to Theorems IT51 and [HI 

1.6. Acknowledgements. The authors would like to thank Persi Diaconis and 
Laszlo Erdos for useful comments. We also thank the anonymous referee for many 
helpful comments, corrections, and references. A.S. has been supported in part by 
the NSF grant DMS-1007558. S.O. has been supported by grant AFOSAR-FA- 
9550-12-1-0083. 

2. Proof of Theorems [TOl and [TT1 

In order to study the limiting distribution of S n ,k [/] —~ES n ,k [/] , we let . . . , £&) 
be a random sample without replacement from {1,2, ...,n} independent of M n . 
Then 

n k k 

SnAf] 4 E f(MM n )) - f(hj (Mn)) - L n [/] - J2 f(k 3 (Mn)), 

i=\ j=l j=l 

where 

L n [f] :=tr[/(M n )]. 
We now take advantage of the following rigidity result based on pj] Theorem 2.2] 
and [3TJ Theorem 3.6]. Let r)j = be the classical location of the jth eigenvalue. 
That is, 

rij j 

p(x)dx = — 

-oo n 

where p is the density of the semicircle distribution given in ([1]) . 

Theorem 15 (Rigidity of eigenvalues). Let M n = -^=W n = -^(wij)i<ij< n be a 
real symmetric Wigner matrix of size n. Assume there exists a constant C\ such 
that 

(9) sup E[i4-]<Ci. 

l<i<j<n 

Then for any < e < 1/2, there exists constants C, c > and no (depending only 
on C\,e, and a from Definition]!^ such that the event 

(10) {3j : |A,-(M„)- % | > (log^^^^^n-^tmin^n-i + l}]- 1 / 3 } 
holds with probability at most 




i=l 



for any n > uq, where the event fl n holds with overwhelming probability. 



is the case when the length of the interval decreases as n tends to infinity; see for instance [8] and 
references therein. 
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The proof of Theorem [15] is based on the machinery developed in [321 EI] ; we 
present the proof in Appendix [A] For the moment, we assume Theorem [15] and 
complete the proof of Theorems [TU] and [TT] 

It follows from Theorem [TBI that 

tu) twi-E^i^ ^H 

i=i 3=1 v J 

with probability 1 — o(l). Here we have used the fact that / is Lipschitz. 

Proof of Theorem \l(A From (jll[) . we have that 

k k 

Ln[f] ~ f(kA M n)) = L n {f] - E + OfM 

3=1 i=i 

with probability 1 — o(l). We note that L n [f] and fivtj) are independent. 

It also follows from Theorem [TBI that 

k 

ES n ,k[f] = EL„ [/] - E £ /( % ) + o/(l) 
i=i 

since / is bounded. 

By Theorem [5] it follows that £„[/] —¥,L n [f] converges to a normal distribution 
with mean zero and variance v 2 [/] . It remains to compute the limiting distribution 
of 

k k 

E/(%)- E E/(%)- 

i=i j=i 

Let £i, • • • be i.i.d. uniform random variables on {1, 2, . . . ,n} independent of 
M n . We begin by noting that 

k k 
3 = 1 3=1 

Let g be an arbitrary bounded, continuous function. Then 

%(e/(%)]= E Je/(%) 

\ J=l / »!,...** distinct V j=l 



n(n - 1) • • • (n- fc+ 1) 

£ »(£/<%)) 

distinct \.7=1 / 



*1 »*=1 \j=l 

A' 



% e/(^) (y 



Therefore the limiting distribution of X^=i/( r fo) ^ s the same as the limiting 
distribution of X)j=i fivg)- A simple computation reveals that ry|. converges to 
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the semicircle distribution as n — > oo. Since / is continuous and bounded on [—2, 2], 
and k is fixed, the proof of Theorem ITOl is complete. □ 

Proof of Theorem \ll\ Suppose min{fc, ri-fc}->ooasn-> oo. We begin by noting 
that 

n—k k 

SnAfl 4 E Z(M M «)) 4 L *\f\ (a^)), 

i=l i=l 

where (£i, £2, ■ ■ ■ , £n) is a random sample without replacement from {1, 2, . . . , n} 
independent of M n . 

Since a n> k — o(l), it follows from Theorem [§] that 

a njk (L n [f]-EL n [f}) — >0 

in probability as n — > 00. Therefore, it suffices to show that 

k 

(12) On,*X;[/(Ae,(M n )) -E/(A ?i (M n ))] — ► ^(O^ 2 !/]) 

»=i 

in distribution as n — > 00 or 

n— k 

(13) a n , fe ^[/(A^(M„))-E/(A e< (M„))] jV(0, d 2 [/]) 

i=l 

in distribution as n — >■ 00. 

We will verify (|12p when k < n — k and verify (|13l) in the case when k > n ~ k. 
In the setting where the sequence {k(n)} n >i alternates between the two cases, we 
use a sub-sequence argument since the limit in each case will be the same. 

Since the argument is the same in each case, we assume k < n — k and verify 
(1121) . In this case, ct nt f- = 0(k- 1 / 2 ). From condition CO, we find that the event (flu) 
from Theorem 1151 holds with probability o(n -1 / 2 ). Since / is bounded, it follows 
that 

k k 

(14) a n , fc EWe < (M n )) = a n , fc E E /(%)+°/( 1 ) 

i=l i=l 

Therefore, by (fTTj) and HU, it suffices to show that 

k 

(15) « n , fc ^[/( % )-E/( % J]^iV(0,d 2 [/]) 

i=l 

in distribution as n 00. (|15[) will follow from Lemma 1161 below. Indeed, since / 
is bounded, a simple computation reveals that 

Var[/( % )] Var[/(V>)] 
as ri — > 00, where "0 is a semicircle-distributed random variable. □ 

Lemma 16. for eac/i n > 1, let (£1 ) ^ e a discrete random sample 

on [n] := {1,2, ...,n}, where k — k{n) is a positive integer sequence such that 

(71) f n 

min{fc, n — k} — ¥ 00 as n 00. Define Q :— ^— for each n > 1 and i — 
1,2, ... ,k. Let f : [0, 1] — > M be a bounded function. Then 

k 

«»,*£[/& (n) ) -E/(Ci n) )] — ► ^(0,/3 2 ) 

i=l 
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in distribution as n — > oo, where 

/3 2 := lim Var[/(c| n) )] 

71— >CX> 

and a„ : fe is defined in ([7]). 

Lemma [TO] is a direct consequence of 131 Theorem 1] (see also [T5] and [TH1 
Section 3] ) . For completeness we give a proof of Lemma [H] in Appendix [Bj 

3. Proof of Theorems [T3l and [Til 

In order to prove Theorems [13] and [14] we require a rigidity estimate for the 
eigenvalues of sample covariance random matrices. Theorem [17] below provides 
such an estimate and is similar to Theorem 1151 

Let jj = Tj-™- 1 be the classical location of the jth eigenvalue. That is, 

13 j 
p MP (x)dx = - 

n 

where pmp is the density of the Marchenko-Pastur law given in @. 

Theorem 17. Let A n = —X^X n be a real (complex) sample covariance matrix of 
size n where X n = (xi 3 )\<i,j< n - Assume there exists a constant C\ such that 

(16) sup E\x i:j \ p < (d^/p-y for all p>l 

l<i,j<n 

and suppose Xij,l < i, j < n have symmetric distribution. Then there exists con- 
stants C, c, Cq,C\ > (depending only on C\) such that 

P (Bj : \\j(A n ) - 7j -| > C(logn) cloglogT V 2 / 3 ) < Cexp (-c (log n) ci loglogn ) 

for n sufficiently large. 

With this rigidity estimate in hand, the proof of Theorems [13] and [14] is nearly 
identical to the proof of Theorems [TOl and [TT| we leave the details to the reader. It 
remains to prove Theorem 1 171 

We will need the following version of Lemma 5.1]. It should be noted that [3 
Lemma 5.1] is much more general than the version stated here. For convenience, 
we define 

y n ~ (logn) loglog ". 

Lemma 18 ([7]). Let A n = ^X^X n be a real (complex) sample covariance matrix 
of size n where X n = (^,:j)i<,:.j<n- Assume there exists a constant C\ such that 
(1161) holds. Then there exists constants C, c, Co > (depending only on C\) such 
that for any ip^ < j < n — <p c n , 

P ("fj-tp* - ^]( A n) < > 1 ~ ex P(- c 0<y5n) 

and 

— > rTTa ^ exp(-c y„). 

7J i(i-i) v 7 

Proof of Theorem \ll\ By the union bound, it suffices to show that 

P (^(40 - 7il > C^n- 2 / 3 ) < Cexp(-co^) 
for each 1 < j < n. Let < e < 1/100. We consider several cases. 
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(i) If en < j < n — tp^, then 



|A i (A„)- 7j -| < 



< 



40^ 



en^n/n) 1 / 3 en 2 / 3 

with probability at least 1 — exp(— CQip n ) by Lemma H51 
(ii) Consider the case when <p c n < j < en. Using that j < en, we have 



I 

n 



PMp{x)dx > 



2n 



V2 



Hi 



since jj < 2. Thus, we obtain the bound 



(17) 



7T 2 7 2 



Since ip^ < j < en, we combine (|17|) with Lemma [T5] to obtain 

< 2(1 _ e) i/3„2 * 2( i_ e) i/,7 

with probability at least 1 — exp(— coip n ). 

(iii) If 1 < j < ifni then Xj < 73^ with probability at least 1 — exp(— CQip n ) by 
Lemma [18] Using the bound (fTTj) for 73^ , we obtain 

|Aj(A«)-7il < 273^ <9tt 2 ^ 

with probability at least 1 — exp(— coip n ). 

(iv) Consider the final case when n — yf n < j < n. First we note that for any 

k < 3p£, 

^ > k - = J PMPix)dx > ^L- J VA^ dx 



and hence 
(18) |4- 7rl _ fe |< 



2/3 



for all k < 3<p c r , 



By Lemma IT51 and the estimate above, it follows that 

2 /3 

(!,•;- • 

(19) \ 3 (An) >7n-3 V = > 4 - 



with probability at least 1 — exp(— CQip n ). By [371 Lemma 3] and Markov's 
inequality, there exists constant C, d > such that 

"tr(A„)L« 2/3 j" 



(A n (A n ) >4 + ^n- 2 / 3 ) 



E 



< 



(4 + ^n- 2 / 3 )L« 2/3 J 



<C" 



+ 

<C'exp(-cV^). 
Combining the large deviation bound above with (fl"9)) yields 



A, ■ = 4 + O 



7 2 /3 
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uniformly for all n—(p n < j < n with probability 1 — exp(— Q((p n )). Therefore, 
by the triangle inequality and (fT8|) 



sup \\j-lj\=0 



,2/3 



with probability 1 — exp(— fl(ip n )). 
Since the cases above cover all 1 < j < n, the proof of Theorem [17] is complete. □ 

Appendix A. Proof of Theorem \W\ 

This section is devoted to the proof of Theorem [15j We will need the following 
version of (21] Theorem 3.6]. 

Theorem 19 (|21|). Let M n = -^=W n be a real symmetric Wigner matrix where 
W n — )i<i,j<n- Suppose there exists constants C\,c\ > and < e < 1/2 such 
that 

sup E[w|j-] < C\ and sup ¥(\ Wij \ > n 1/2 " e ) < e""" 1 . 

l<i<j<n 

Then there exists constants c > and uq (which depend only on Ci,e } and a from 
Definitional^) such that the event 

n 

|J {\\j(M n )-r,j\ < (log^^^'^^-^Iminan-j + l}]- 1 ^} 
i=i 

holds with overwhelming probability for any n > tiq. 

Proof of Theorem\lb\ Set e n :— n 1 / 2 ^ 6 ; we remind the reader that < e < 1/2 
and hence e n — > oo as n — >■ oo. We begin with a truncation. Let 

w r] ■ := Wijl{\ Wij \< en } for l<i<j<n. 

We define the values mj :— Ewij and := Efu^-] — Ekii 2 ^] for 1 < i < j < n. Then 
by ©, we have 

(20) sup \fjtij\ < — , sup \fiu\ < — 

l<i<j<n l<i<n £n 
Q 

(21) SUP T?j < —, SUp T 2 < (T 2 . 

l<i<j<n &n l<i<n 

For 1 < i < j < n define the random variable tDy as a mixture of 

• u>ij with probability 1 — ^r 2 ^- — and 

• Zij with probability + 

where , 1 < i < j < n are independent Bernoulli random variables independent 
of W n . Set Wji = Wij for 1 < i < j < n. Let W n — (wij)i<ij<n and M„ = -j=W n - 

We now show that there exists Bernoulli random variables z%j such that M n is a 
real symmetric Wigner matrix that satisfies 

(22) sup \wij \ < n 1 / 2-6 / 2 almost surely 
and 

(23) sup E[wfj] < 513Ci. 

l<i<j<n 
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In particular, we will construct Zij to be a Bernoulli random variable, symmetric 
about its mean, such that its mean and second moment satisfy 

(24) = iHj (\ - 

(25) E[tu?.] = (E[v%] 
for 1 < i < j < n. We first note that, by definition of uiy, we only need to consider 

I I r 2 

the case when + 4f > 0. Suppose a^, are real numbers that satisfy 
= M « 



E[«;?-] = (E[to?.] 




From the first equation, we obtain 



(26) 





From the second equation, we have 

(27) b% ( M + J) = (EK] - 4) 
We now note that > £ n |^t,-j| by definition of u)y and hence 

It then follows that 

(28) 4 > '^ |2 t2 
Combining ([26|. ([27) . and ([28]). we obtain 

,2 ( lAHjj , T S | \ _2 ^ Vij s,„2 [ iMijj 



and hence 6?. > a?- . We can now define 



4 



bf- ~ of- with probability 1 /2 



tfj ~ a ij w ith probability 1 /2 

It is straightforward to verify that has mean a^- and second moment bfj . 
By construction M n is a real symmetric Wigner matrix. We now verify (|22 



and (|23|) . By solving equations (f24|) and (f25|) for E[sy] and E[z?.] and applying the 



bounds (|20|) and (|2Tjl . it follows that < 4e„. Thus we conclude that ([221 holds 
for n sufficiently large. 
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'J 



We also have for 1 < i < j < n 

E[«fc] = E[tifc] (l-M- T l)+ E[4] (W rJ 

<d + (4e„) 4 2^ 
< 513Ci 



by (|20|) and (f2T1) . This verifies (123)) and hence M n satisfies the conditions of Theo- 
rem [THl 

By Theorem Q33 there exists a constant c > such that the event 

n 

■= |J {\Xj(M n )-Vj\ < (log^^^^^-^Iminan-j + l}]- 1 / 3 } 
3=1 

holds with overwhelming probability. Thus we obtain 

P(3j : |Aj(Jkr„) — > (logn) cl °s lo s"7i- 2 / 3 [min{j, n-j + 1}]- 1 / 3 ) 

<P(0£)+P(M„^M n ). 
The proof of Theorem is now complete by noting that 



P(M„ ^ Af„) < ^ P(|u^| > e„) + ^ 



2 2 

-71 H E [«'« 1 {|«i«l>e»}]+ j"Z) E [ tt « 1 {|™«l>=»}] 

« l<i<i<n « i=l 



Appendix B. Proof of Lemma HA 



□ 



In order to prove Lemma 1161 we use the central limit theorem for martingale 
difference sequences. 

Theorem 20 (Theorem 35.12 of [B]). For each N, suppose Zjyi, Zn2, • • • , Znt n is 
a real martingale difference sequence with respect to the increasing a -field {Fnj} 
having second moments. If as N oo ; 

(29) ^(Z^^h)^^ 

where v 2 is a positive constant, and for each e > 0, 

(30) ^E(Z^-l { | ZjVj |> e} )^0 

3=1 

then 

3=1 

We will also need a number of computations, which we collect in the following 
lemma. 
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Lemma 21 (Computations). For each n > 1, let . . . , £fe™^) &e a discrete 

random sample on [n] := {1, 2, . . . , n}, where k = k(n) is a positive integer sequence. 



Define £, 



(«) ._ Zl 



(n) 



/or each n > 1 and i = 1, 2, . . . , fc. Let g : [0, 1] — > R &e a bounded 



function. Then there exits a constant C > (depending only on the function g) 
such that 



(31) 



E 



E^io(C (n) ) 



< 



C 



n - j + 1 



and 



(32) 



E 



E,_ l5 ( C f) - Eff(CD 



< 



C 



n - j + 1 



Proof. We write 



E 



Ej-ifl(d B) )' 



g{s/n)g(t/n) 
^ (n - j + l) 2 



o(s/n)g(t/n) 

n(n - 1) • • • (n - j + 2) ^-f (n-j+ l) 2 



E 



E 



^ g{s/n)g(t/n 



E 



Z—i (n— i 



T[{n- J + 1) 

g(s/n)g(t/n 



E 



i 



E 



2 ^ n(n-l)---(n-j + 2) 

1 



E 



^ ("-^ + 1 ) 2 | S , =i fir., t(Efl »(» - 1) ■ ■ ■ (» - J + 2) 

1 EAd B) ) + n - j+2 



n — j + 1 

E 



ff(Ci (n) )ff(d n) ) 



n - j + 1 
+ 0/ 



E 



0(d n) Md n) ) 



n-j + lj' 



where the set S in the sums above is an ordered set. We now note that 



fl«$ B) Md B) )]=E 



g{s/n)g(t/n) 
n(n — 1) 



= ^ g(s/n)g(t/n) + Q J I 



s=tt 



J2 9(s/n)g(t/n) +0 Jl 



s,t=l 
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Combing the estimates above yields (|3"Tj) . For (|32| , we write 



E 



E,-i.9(C 



E 



E 



gfti/n) ■ ■■gjU/n) 
(n-j + l) 4 



E 



E 



g(tt/n) ■ ■■g{t A /n) 



n{n- l)-..(n -j + 2) ^ (n - j + l) 4 



E 



g(ti/n) ■ ■■g{t i /n) 



E 



*,...*=! (n - J + 1)4 \s^-£,...MiS n{n-l)--{n-j + 2Y 

where the set 5* in the sums above is an ordered set. We now consider several cases 
where ti, . . . ,ti are not distinct. 

(1) When the first sum is over t\ = and ti,t^,ti are distinct, we obtain 
g(ti/n) ■ ■ ■ g(t 4 /n) ^ 1 



E 



( n -j + iy 



E 



|S|=j-l;ti,t 3 ,*4^S 



i{n - 1) • • • (n - j + 2) 



E 

£ 1 i*3>*4 



g(h/n) 2 g(t 3 /n)g(U/n) (re - 3) ■ ■ ■ (re - j - 1) 
(n - j + l) 4 n(n - 1) • • • (n - i + 2) 



1 



{n-j)(n-j - 1) 
(n - j + l) 3 



v n — j + 1 / 

(2) When the sum is over t\ = t% = ta ^ ti, we have 

g{tx/nfg{U/n) ^ 1 



E 



E 



E 



9(C («) ) 3 p(C («) ) l ("-J + D(n-J) 



(n-j+ iy 



1 



M (n-j + l) 2 
(3) When the sum is over t\ = t% = t% = we obtain 



E 



g(hH 4 



E 



l 



^(n-j + l) 4 ^ n(n-l)---(n-j + 2) 



ff (d n) ) 4 



(n-j + 1) 3 



(n-j + l) 3 
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Combining the above bounds yields 



= E 



g(ti/ri) ■ ■■g(t 4 ,/n) 



E 



(n- j + l) 4 ^ n(n - 1) • • • (n - j + 2) 

ti,...,t 4 distinct v J ; \S\=j-l;t 1 ,...,t i £S 



1 



- j + 1, 

When ti, . . . , *4 are distinct, we can compute the inside sum and obtain 



E 



%-is(C 



(n). 



E 



E 



:9(d n) )---5(CD 

1 



(n - j)(n - j - l)(n - j - 2) 
(n - i + l) 3 







n — j + 1 



5(d n) )---3(cD 



(n)s 



Lastly, we note that 



s(d w) )-fl(d B) ) 



E 

ii ,...,£4 distinct 



o 



n-j + l 



g(h/n) ■ ■■g(U/n) 



n(n-l)(n-2)(n-3) 



E 

ti ,...,i4 distinct 



#(ii/n) • ■■g(U/n) 



E 

ti,...,t 4 =i 



g{tx/n) ■ ■■g{t i /n) 



O g [- 



O a 



= [%(Ci (n) / 

and the proof of Lemma |2"T1 is complete. 

Proof of Lemma \lb\ We will use Theorem [50] to prove Lemma 1161 We write 

«n, fe Et/(cl n) )-E/(cl n) )]=E^ 

1=1 

where 



^ := «»,* E [%/(Ci n) ) - %-i/(Ci n) ) 



Ej denotes expectation with respect to the er-algebra T n .j , and J- n j = cr(Ci"' ) )•••)£}"'')• 
By considering the cases when i < j, i = j, and i > j, we have that 

Z»j = a„, fc [/(Cj n) ) - E J - 1 /(cj" ) ) + (fc - j) (E,-/(CS) - ^-if(C^)) 



a 
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We now compute 



w(c£\) 



E /(^)r 1 - 



t^{d" , ,...,?<" ) } 



n-j 



n-j 



Thus, 



k 



Z nJ = an.k'^- 1 /(d' l) ) - Ei-i/CCD 



J n — j 3 



4nh 



n-j 



Since / is bounded and a n _k = o(l) it follows that Z n j — o(l) uniformly for 
j = 1, 2, . . . , k. So the events {\Z n j\ > e} are empty for n sufficiently large. Thus 
(|5D)I holds. 

We now verify (|2^|) and compute the limiting variance. We note that 
(33) 

E = ^(-^EtAi k--i/ 2 (cf ) - (Vi/(cf )f 

We will show that 

(34) < fc (n-fc) 2 E 
and 

(35) a 2 n Jn-k) 2 E 



E 



^ (n - j) 2 



Ei-i/ a (cj n) )-i:/ a (d n) ) 



E(^f( E -^i n) )) 2 -( E ^ 



as n-> oo. 

For (jM)) . it suffices to prove that 



0. 



By Lemma we have that 



^- fc ) 2 E(^-jW z 



an,fc(«- fc ) 2 E 



E,-_!/2(c] n) ) - E/ 2 (Cl n) ) 



<< fc (n-fc) 2 £. V( " 



3 E^i/^d" 3 )! - fE/2( C ("))' 



(n - j) 2 - 5 



\/~C 1 

" nM (n-j) 2 



< 



C 



\/n — k 



(n- l)(n- jfe) 



0. 



PARTIAL LINEAR. EIGENVALUE STATISTICS 



19 



Here the last inequality comes from a comparison argument between (n-j)' 2 
and an appropriate integral. This verifies (|34p . The proof of (|35p is similar and 
uses d^- 
Using (fM|) and (|3"5j) . we have that 

k k 

^E^IZ^} - Var[/(c| n) )]a^ fe (n - k) 2 £ ? — ^ — > 

in probability as n — > oo. However, a comparison argument verifies that 

k x 

!™ a l,k( n ~ k ? Yl 7 = 1 

n— foo ' * — » [ji — J j z 

and the proof of Lemma [TBI is complete. □ 
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